Standard cell library having cell drive strengths selected according to delay

ABSTRACT

A cell library which enables reduced quantization over-design in large scale circuit design is provided. Library cells having the same cell function have drive strengths selected to provide delays about equal to a predetermined set of design delays, at a nominal load corresponding to the cell function. In contrast, conventional cell libraries typically have drive strengths which correspond to a predetermined set of cell physical areas. Preferably, the spacing between adjacent design delays is a non-decreasing function of cell drive strength. Such spacing reduces quantization induced over-design compared to conventional cell libraries which have a design delay spacing that is a decreasing function of cell drive strength.

FIELD OF THE INVENTION

This invention relates to the automatic design of large scale circuits.

BACKGROUND

Design of large scale electrical circuits is frequently automated by provision of a library of standard cells for performing various circuit functions. In typical large scale circuits (also referred to as VLSI circuits), standard cell circuitry typically occupies from 50% to 70% of the circuit area, with the remainder being memory. Most of the power consumption (both switching power and leakage power) is in the standard cell circuitry.

Cell library functions can include, for example, logic functions such as AND2 (a 2-input AND gate) and OR4 (a 4 input OR gate) and switching functions such as DFF (a D flip-flop). Inverters, NAND gates and NOR gates are also typically included in cell libraries. A standard cell library typically provides multiple cells having the same cell function (e.g., AND2) and differing in drive strength (e.g., AND1×2, AND2×2, etc.). Cells having higher drive strength generally consume more electrical power, but can be used to drive a larger load, or can be used to improve circuit speed.

For example, FIG. 1 schematically shows cell delay as a function of load for a set of cells performing the same function but having different drive strengths, labeled ×1, ×2, ×3, and ×4. By increasing drive strength for a fixed delay, as shown by line 102, a larger load can be driven. By increasing drive strength for a fixed load, as shown by line 104, delay can be reduced. Thus, provision of several cells having different drive strength for performing a certain function provides design flexibility in the cell library. Indeed, a cell-level design is largely a determination of which drive strength to use for each cell function required in a circuit.

Large scale circuit design is frequently formulated as a minimization of power consumption subject to constraints on circuit-level delay, which lead ultimately to constraints on cell-level delay. The relation between circuit-level delay and cell-level delay is generally complex, and is usually accounted for by an automated design tool used in the design process. One example of the complexity in relating cell-level delay to circuit-level delay is that increasing the drive strength of a particular cell Z decreases its delay, but tends to increase the load on the cell(s) Y providing input to cell Z. The increased load on cell(s) Y tends to increase their delay.

For the purposes of this description, “delay” can be a rise delay or a fall delay, or any combination thereof (e.g., an average of rise delay and fall delay). A delay can also be a switching time, or any other cell timing parameter which decreases as cell speed increases. Delays can be state-dependent (e.g., a delay from input A to output Z can depend on the state of a second input B).

For convenience in cell library design and cell layout, the drive strengths for each cell function are usually selected to provide a predetermined scaling of total transistor active area within a cell. For example, the ×2 cell typically has twice the transistor active area of the ×1 cell (with similar scaling for the other drive strengths). The ×2 cell is also often designed to have twice the physical area of the ×1 cell (also with similar scaling for the other drive strengths), in order to simplify cell layout. For example, if the ×1 cell is regarded as a “brick”, then cell layout is simplified if all the larger cells have the configuration of 2 or more adjacent “bricks”. An example of such a configuration is schematically shown on FIGS. 2 a and 2 b, corresponding to an ×1 and an ×2 cell respectively. On FIG. 2 a, a cell 202 includes a transistor having a gate contact 206 between a source 204 and a drain 208. FIG. 2 b shows a cell 210 that is twice as large as cell 202, and includes a transistor having a gate contact 214 between a source 212 and a drain 216. The width of the transistor of cell 210 is twice the width of the transistor of cell 202, and thus has twice the active area. For cells having multiple transistors, typically all of the transistors in the cell are scaled together to provide the various drive strengths.

However, the conventional approach to providing cells having varying drive strength described above suffers from a notable drawback, in that significant cell over-design often occurs in practice. This drawback is best appreciated in connection with FIG. 3, which shows a typical distribution of cell drive strength in a large scale circuit. A noteworthy feature of FIG. 3 is that the number of cells is a steeply decreasing function of drive strength. Although the relation between circuit level delay and cell level delay is complex, as indicated above, the design automation tool is ultimately faced with a requirement to select a cell from a finite set of cells having different drive strengths, and usually selects the smallest possible cell to minimize power.

This quantization (or granularity) of cell drive strengths inherently leads to over-design. For example, if a delay corresponding to a drive strength of ×1.1 is required, and the choices are ×1 and ×2, ×2 will be chosen in order to meet the requirement. Similarly, if a drive strength of ×1.9 is required, and the choices are between ×1 and ×2, ×2 will be chosen. In the latter case, the over-design entailed by use of ×2 where ×1.9 would suffice is much less than in the former case, where ×2 is used where ×1.1 would suffice. On FIG. 3, the number of cells is a steeply decreasing function of drive strength, and it is therefore likely that most of the ×2 cells are in fact significantly over-designed (i.e., more like the ×1.1 example above than the ×1.9 example above).

In the example where only an ×1.1 cell was needed and an ×2 cell had to be selected due to drive strength quantization, power consumption is unnecessarily increased by the difference in power consumption between an ×2 cell and an ×1.1 cell. Both switching power and leakage power are undesirably increased by such quantization. Some known design approaches inherently avoid this quantization problem, by reliance on continuous scaling of cell drive strength and/or transistor size during design. These approaches also have their drawbacks. More particularly, such approaches tend to complicate the design process and undesirably increase design time. In other words, the advantage in design simplicity offered by cell library design is partially (or even completely) lost because of further optimization required after the cell level design is complete.

For example, U.S. Pat. No. 4,827,428 considers a method for design optimization where it is assumed that transistor size (i.e., drive strength) can be continuously varied. While such an approach inherently avoids over-design due to quantization, the assumed continuous scalability of transistor sizes is also inherently much more complicated than design with standard library cells having quantized drive strengths.

It should also be noted that many prior art references are concerned with aspects of large scale circuit design independent from the over-design problem identified above. This is not surprising, since large scale circuit design is highly complex, and can therefore be approached from many different and unrelated viewpoints. For example, U.S. Pat. No. 5,724,250 considers detailed methods and algorithms for efficient cell substitution of library cells having different drive strength in a circuit design. Such substitution algorithms do not address the quantization over-design issue identified above. As another example, U.S. Pat. No. 6,496,965 considers provision of variable drive strength cells by automatically wiring 2 or more standard cells together in parallel. While this is an alternative to providing ×1, ×2, etc. cells in the library, wiring cells together in parallel does not address the quantization over-design issue identified above.

Another approach is considered in U.S. Pat. No. 5,633,805, where a cell library having a two-dimensional cell sizing progression is considered, where minimum load and maximum load are treated as independent variables. In U.S. Pat. No. 5,598,347, cell libraries having cells with different drive strength but the same width are considered. Similarly, U.S. Pat. No. 5,663,662 considers a cell library having cells with different drive strength but the same physical area and terminal locations. These three approaches are also concerned with providing solutions to design problems other than the above-identified quantization over-design issue.

Accordingly, it would be an advance in the art to provide a cell library enabling reduced quantization over-design in cell level design. It would also be an advance in the art to provide such reduced over-design without adding significant complexity to the overall circuit design process.

SUMMARY

The present invention provides a cell library which enables reduced quantization induced over-design in large scale circuit design. Library cells having the same cell function have drive strengths selected to provide delays about equal to a predetermined set of design delays, at a nominal load corresponding to the cell function. In contrast, conventional cell libraries typically have drive strengths which correspond to a predetermined set of cell physical areas. Preferably, the spacing between adjacent design delays is a non-decreasing function of cell drive strength. Such spacing reduces quantization induced over-design compared to conventional cell libraries which have a design delay spacing that is a decreasing function of cell drive strength.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows delay vs. load for cell library cells having different drive strength and performing the same cell function.

FIGS. 2 a and 2 b show an ×1 cell and an ×2 cell respectively.

FIG. 3 shows a typical distribution of cell drive strengths in a large scale circuit design using a conventional cell library.

FIG. 4 shows delay vs. load for library cells in accordance with an embodiment of the invention.

FIG. 5 shows a distribution of cell drive strengths expected in a large scale circuit design using a cell library according to an embodiment of the invention.

FIG. 6 a shows delay vs. drive strength at a nominal load.

FIG. 6 b shows delay vs. drive strength for library cells in accordance with an embodiment of the invention.

FIGS. 7 a and 7 b show an ×0.5 cell and an ×1.5 cell in accordance with a preferred embodiment of the invention.

DETAILED DESCRIPTION

FIG. 4 shows delay vs. load for library cells in accordance with an embodiment of the invention. On FIG. 4, a nominal cell function load 420 is shown by a vertical dashed line. A set of predetermined design delays, shown as 402, 404, 406, and 408 in this example, is defined on FIG. 4. Various methods for defining these design delays will be discussed below. According to the invention, drive strengths for cells performing the same cell function are selected such that cell delays at the nominal load 420 are about equal to the design delays 402, 404, 406, and 408. In other words, cell drive strengths are selected such that the corresponding delay vs. load curves 410, 412, 414, and 416 intersect the nominal load line 420 at or near points 402, 404, 406, and 408 respectively. In the example of FIG. 4, it is assumed that the drive strengths corresponding to curves 410, 412, 414, and 416 are ×0.7, ×1.0, ×1.2, and ×1.45 respectively. Here, and throughout this description, the X designation for drive strength is intended for comparison with conventional usage. Thus an ×1.2 cell has 1.2 times the active area of an ×1 cell, and ×0.8 cell has 0.8 times the active area of an ×1 cell, etc.

In the example of FIG. 4, the design delays 402, 404, 406, and 408 are about evenly spaced, which tends to reduce quantization induced over-design. For example, if a cell needs slightly more than an ×1 drive strength, the next available drive strength is ×1.2. Indeed, if a cell requires less than ×1 drive strength, the ×0.7 library cell may suffice. In either case, significant reduction of power consumption is provided by the availability of such library cells, compared to conventional libraries having ×1, ×2, etc. cells. Since time delay is what typically constrains the selection of drive strength, provision of library cells having delays equal to predetermined design delays (i.e., according to the invention) provides a more appropriate set of cell drive strengths than provided by conventional libraries.

For example, referring back to FIG. 1, note that the delay difference at load 104 is larger between the ×1 and ×2 cells than it is between the ×2 and ×3 cells (or between any other two adjacent cells). Thus conventional cell libraries tend to have the largest delay difference between the two most commonly used cells in typical designs (as indicated in FIG. 3). This unfortunate characteristic of conventional cell libraries leads to significant over-design in practice, since most of the cells that need more than an ×1 drive strength need only slightly more than an ×1 drive strength, but the next available drive strength is ×2.

Although the example of FIG. 4 shows evenly spaced predetermined design delays, alternative embodiments are possible where the design delays are more closely spaced for cells having lower drive strength (which are typically more commonly used) than for cells having higher drive strength (which are typically less commonly used).

It is helpful to define the design delay spacing of a cell X as the delay difference at the nominal load between cell X and the cell having the next largest drive strength above that of cell X. Conventional cell libraries provide a design delay spacing that is a decreasing function of cell drive strength, and so the delay spacing is largest for cells having the lowest drive strength (typically the most commonly used cells). Preferable embodiments of the invention, including both alternatives considered above, provide a design delay spacing that is a non-decreasing function of cell drive strength. Such provision of minimal delay spacing for cells having the lowest drive strength is a key advantage of the invention, since the low drive strength cells are most commonly used in typical designs.

The selection of the exact value to use for nominal load 420 is not crucial for practicing the invention. Instead, a key idea of the invention is to size the library cells in accordance with delay at a given nominal load, as opposed to the conventional and arbitrary scaling of cell active area. Thus the exact value used for the nominal load 420 is not especially significant. In many cases, nominal load 420 can be a typical ×1 load, since ×1 loads are representative loads for a large fraction of cells in typical designs. This is to be expected, since automatic design tools tend to place connected cells close to each other to reduce delay, and the resulting short connections between cells typically do not add significant load.

Selection of cell drive strength according to delay at a nominal load will tend to provide cells having drive strengths which are more evenly utilized in practice. For example, FIG. 5 shows an example of a distribution of cell drive strengths expected in a large scale circuit design using a cell library according to an embodiment of the invention. Certain advantages of the invention can be appreciated by comparing FIG. 5 to FIG. 3.

On FIG. 3, the ×2 cell is used much less often than the ×1 cell, the ×3 cell is used much less often than the ×2 cell, etc. Even though the library of FIG. 3 has six cell drive strengths, these six cell drive strengths are not equally useful, since most of them are hardly utilized at all in typical designs. In sharp contrast, the example of FIG. 5 shows six drive strengths, of which four are significantly utilized in a typical design. Quantization induced over-design is reduced in the embodiment of FIG. 5 compared to the conventional approach of FIG. 3, because the drive strengths of FIG. 5 have improved resolution where it matters (i.e., for cells which are more frequently used in typical designs).

Although conventional cell sizing (e.g., ×1, ×2, ×3, ×4, ×8, and ×16) may appear to provide finer drive strength resolution for small drive strengths than for large drive strengths (e.g., the difference between ×1 and ×2 seemingly being smaller than the difference between ×8 and ×16), a significant discovery of the present invention is that this impression is frequently incorrect. In terms of delay, the difference between ×1 and ×2 can be significantly larger than the difference between ×8 and ×16. Thus provision of fine drive strength resolution for small drive strength cells (e.g., ×0.8, ×1.3 etc.) can significantly improve design.

Note that the number of cell drive strengths for a particular function in a cell library cannot be unduly increased without adverse consequences. As the number of cell drive strengths increases, the cell library database size increases, as does the number of alternatives that are considered in the course of an automated design. Both of these factors tend to increase design time. For this reason, it is impractical to reduce quantization induced over-design by simply providing a very large number of cell drive strengths (e.g., a conventional ×1, ×2, etc. approach where the ×1 cell is very small and typical cells are often ×10 or more). Thus an important advantage of the invention is provision of a cell library having cells more appropriately and efficiently sized than cells in conventional libraries.

FIG. 6 a shows delay vs. drive strength at a nominal load. The curve of FIG. 6 a can be regarded as the delay vs. drive strength curve along line 104 of FIG. 1. Plots as shown in FIG. 6 a provide an alternative viewpoint for appreciating aspects of the present invention. For example, FIG. 6 b shows delay vs. drive strength for library cells in accordance with an embodiment of the invention. In the example of FIG. 6 b, predetermined design delays 602, 604, 606, 608, 610, and 612 are defined having values of 160, 140, 120, 100, 80, and 60 ps respectively, and cell drive strengths are selected to provide delays about equal to these design delays. In the example of FIG. 6 b, the corresponding drive strengths are about ×0.5, ×0.65, ×0.8, ×1.0, ×1.2, and ×1.5 respectively.

The example of FIG. 6 b shows one way of selecting predetermined design delays, namely selecting the predetermined design delays to be equally spaced within a certain range. Preferably this range includes, as shown in FIG. 6 b, commonly used drive strengths. In some cases, not all of the design delays will be equally spaced, since it may be useful to include in a cell library a largest cell for a function, for use in rare cases where no smaller cell suffices, and in such cases, there is no particular reason to enforce a relation between the delay of such a largest cell and the other cell delays. Alternatively, as indicated in connection with FIG. 4, it may be desirable to select design delays that are more closely spaced for cells having smaller drive strength than for cells having larger drive strength. Another alternative for design delay selection relies on selection of design delays according to delay variation. For example, design delays selected to provide low delay variation are suitable for cells (e.g., buffers and inverters) for chip-level clock distribution, since low delay and, more importantly, low delay variation are required for clock distribution.

A noteworthy feature of the preceding examples of embodiments of the invention is that cells having fractional drive strength are employed. For example, cells according to the invention can have drive strengths of ×0.7, ×0.9, ×1.2, etc., as opposed to conventional integral drive strengths limited to ×1, ×2, ×3, etc. Note that these drive strengths, as indicated above, refer to active area, such that an ×2 cell has twice the active area of an ×1 cell, and an ×1.2 cell has 1.2 times the active area of an ×1 cell, etc. However, cell active area and cell physical area do not necessarily scale together, and so in practicing the invention, various choices can be made in relating cell physical size to cell drive strength.

One approach is to minimize cell physical area, which will tend to result in each cell having a different physical area. Furthermore, these physical cell areas will not have any simple relation among them. Recall that conventional cell libraries typically provide an ×2 cell that has the configuration of two ×1 cells side by side (and similarly for larger cells), which significantly simplifies layout. A set of cells having fractional drive strength and minimal physical area for each drive strength will generally not have areas which are multiples of a unit area. Such cells will tend to complicate layout, and accordingly minimization of physical area for each cell drive strength is not a preferred approach for practicing the invention.

Instead, it is preferred to retain the layout simplicity provided by conventional cell libraries having a relatively small number of distinct cell physical areas. Suppose, for example, that drive strengths ×0.6, ×0.7, ×1.2, and ×1.5 are desired in a particular cell library according to the invention. If the ×0.6 and ×0.7 cells have the same physical area A1 and the ×1.2 and 1.5 cells both have area A2, the desired layout simplicity will be obtained, since 4 cells have only 2 different areas. Preferably, these cells also have one dimension being the same (e.g., either width or height). There are various ways to ensure this. An ×1 and an ×2 cell can be designed having physical areas A1 and A2 respectively. Scaling of transistor sizes within the ×1 cell, without changing cell physical area, can then be used to obtain the ×0.6 and ×0.7 cells. Similarly, scaling of transistor sizes within the ×2 cell, without changing cell physical area, can be used to obtain the ×1.2 and ×1.5 cells. A more area-efficient alternative for this particular example is to design ×0.75 and ×1.5 cells having physical areas A1 and A2 respectively. Thus cells having a relatively large number of different drive strengths preferably have a relatively small number of predetermined cell physical areas.

FIGS. 7 a and 7 b show an ×0.5 cell 702 and an ×1.5 cell 710 in accordance with this preferred cell physical sizing approach. The ×0.5 cell 702 of FIG. 7 a has the same physical area as the ×1 cell 202 of FIG. 2 a, and the ×1.5 cell 710 of FIG. 7 b has the same physical area as the ×2 cell 210 of FIG. 2 b. Cell 702 has a gate contact 706 between a source 704 and a drain 708. The active area of cell 702 is half the active area of cell 202 because source 704 and drain 708 have half the width of source 204 and 208 respectively. Cell 710 has a gate contact 714 between a source 712 and a drain 716. The active area of cell 710 is ¾ the active area of cell 210 because source 712 and drain 716 have ¾ the width of source 212 and 216 respectively.

When cells according to the invention and having different drive strengths are designed to have the same physical area, as in FIGS. 2 a-b and 7 a-b, then some cells will typically have non-minimal physical area (e.g., the 0.5× and 1.5× cells of FIGS. 7 a and 7 b). Cells having non-minimal physical area are usually not present in conventional cell libraries, since minimization of circuit physical area is often assumed to be a paramount design consideration. Thus, one of the discoveries of the present invention is that use of cells having non-minimal physical area can provide compensating advantages, such as ease of layout in combination with reduced quantization induced over-design.

In a preferred embodiment, a cell library of the present invention also provides active region geometrical parameter information for each cell. Use of such active region information is discussed in detail by the present inventors in a co-pending US patent application entitled “Automatic Circuit Design Method with a Cell Library Providing Transistor Size Information” filed on even date herewith and hereby incorporated by reference in its entirety. A cell library providing both active region geometrical information and having cell drive strengths selected according to delay is highly advantageous for circuit design. Provision of cells having drive strength selected according to delay reduces quantization induced over-design, and provision of active region information allows powerful automated design tools to efficiently select lower drive strength cells where appropriate.

For example, in a typical circuit design, roughly 20% of the circuit paths are timing critical, and the remaining 80% of the paths are not timing critical. Conventional cell libraries typically provide ×1 as the smallest cell drive strength. Cells in paths which are not timing critical are usually the slowest available cells (e.g., ×1 cells), leading to designs which typically have a very large fraction of ×1 cells. Thus, over-design is commonplace in circuit paths which are not timing critical, since many ×1 cells could be replaced by slower cells.

However, mere provision of slower cells having reduced power consumption in a cell library does not enable an automated design tool to efficiently utilize such cells, since power and timing analysis is typically too time-consuming to perform on non critical parts of the circuit. Thus, the active region information of this preferred embodiment enables efficient automatic utilization of slow, low-power cells by providing one or more simple parameters to the automatic design tool that correlate well with power consumption and speed. This approach can be regarded as an efficient optimization of the 80% or so of typical circuit paths that are not timing critical.

The preceding description discusses cell libraries and methods for constructing a cell library (by selecting library cell drive strengths) according to the invention. Such cell libraries can be embodied as a database on a computer-readable medium, such as a magnetic or optical disk. Accordingly, a set of computer instructions recorded on a computer-readable medium that provide a cell library as discussed above is also an embodiment of the invention. 

1. A cell library for use in automatic design of a large scale circuit: wherein the library comprises a plurality of cells providing a plurality of cell functions; wherein each of said cell functions has a corresponding nominal cell function load; wherein each of said cell functions is provided by three or more of said cells, each having a drive strength and a delay at said nominal cell function load depending on said drive strength; wherein said delays are substantially equal to a set of predetermined design delays for each of said cell functions; wherein a design delay spacing is defined for each pair of said cells performing the same one of said cell functions and having adjacent delays by taking the difference of the delays of the two cells of said pair; wherein the design delay spacings corresponding to at least one of said cell functions do not decrease as drive strength increases.
 2. The cell library of claim 1, wherein three or more of said predetermined design delays are about evenly spaced in time.
 3. The cell library of claim 2, wherein said evenly spaced design delays are spaced apart by about 10 picoseconds.
 4. The cell library of claim 1, wherein each of said cell functions is provided by one or more of said cells having physical areas substantially equal to a predetermined set of physical areas.
 5. A method for constructing a cell library for use in automatic design of a large scale circuit, the method comprising: a) providing a plurality of cell functions, each having a corresponding nominal cell function load; b) providing, for each of said cell functions, three or more cells, each having a drive strength and a delay at said nominal cell function load depending on said drive strength; and c) selecting said delays to be substantially equal to a set of predetermined design delays for each of said cell functions; wherein a design delay spacing is defined for each pair of said cells performing the same one of said cell functions and having adjacent delays by taking the difference of the delays of the two cells of said pair; wherein the design delay spacings corresponding to at least one of said cell functions do not decrease as drive strength increases.
 6. The method of claim 5, wherein three or more of said predetermined design delays are about evenly spaced in time.
 7. The method of claim 6, wherein said evenly spaced design delays are spaced apart by about 10 picoseconds.
 8. The method of claim 5, wherein each of said cell functions is provided by one or more of said cells having physical areas substantially equal to a predetermined set of physical areas.
 9. A set of computer instructions recorded on and executable from a computer-readable medium for providing a cell library for automatic design of a large scale circuit: wherein the cell library comprises a plurality of cells providing a plurality of cell functions; wherein each of said cell functions has a corresponding nominal cell function load; wherein each of said cell functions is provided by three or more of said cells, each having a drive strength and a delay at said nominal cell function load depending on said drive strength; wherein said delays are substantially equal to a set of predetermined design delays for each of said cell functions; wherein a design delay spacing is defined for each pair of said cells performing the same one of said cell functions and having adjacent delays by taking the difference of the delays of the two cells of said pair; wherein the design delay spacings corresponding to at least one of said cell functions do not decrease as drive strength increases.
 10. The set of computer instructions of claim 9, wherein three or more of said predetermined design delays are about evenly spaced in time.
 11. The set of computer instructions of claim 10, wherein said evenly spaced design delays are spaced apart by about 10 picoseconds.
 12. The set of computer instructions of claim 9, wherein each of said cell functions is provided by one or more of said cells having physical areas substantially equal to a predetermined set of physical areas. 